An improved wavelet-based dereverberation for robust automatic speech recognition
نویسندگان
چکیده
This paper presents an improved wavelet-based dereverberation method for automatic speech recognition (ASR). Dereverberation is based on filtering reverberant wavelet coefficients with the Wiener gains to suppress the effect of the late reflections. Optimization of the wavelet parameters using acoustic model enables the system to estimate the clean speech and late reflections effectively. This results to a better estimate of the Wiener gains for dereverberation in the ASR application. Additional tuning of the parameters of the Wiener gain in relation with the acoustic model further improves the dereverberation process for ASR. In the experiment with real reverberant data, we have achieved a significant improvement in ASR accuracy.
منابع مشابه
Robust Speech Recognition Using Optimized Wavelet Filtering in Reverberant Conditions
Speech recognition in reverberant environments is a difficult task. Reverberation has the effect of degradation of recognition performance due to acoustic mismatch. We present an optimization method of the wavelet parameters for dereverberation in automatic speech recognition (ASR). By tuning the wavelet parameters to improve the acoustic model likelihood, waveletbased dereverberation methods b...
متن کاملOptimizing Wavelet Parameters for Dereverberation in Automatic Speech Recognition
We present an optimization method of the wavelet parameters for dereverberation in automatic speech recognition (ASR). By tuning the wavelet parameters to improve the acoustic model likelihood, wavelet-based dereverberation methods become more effective in the ASR application. We evaluate several existing wavelet-based methods and optimize them, based on our proposed scheme. Experimental evalua...
متن کاملDereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition
This paper describes a multiple-resolution signal analysis to suppress late reflection of reverberation for robust automatic speech recognition (ASR). Wavelet packet tree (WPT) decomposition offers a finer resolution to discriminate the late reflection subspace from the speech subspace. By selecting appropriate wavelet basis in the WPT for speech and late reflection, we can effectively estimate...
متن کاملA Simplified Decoding Method for a Robust Distant-talking Asr Concept Based on Feature-domain Dereverberation
A simplified decoding method for the concept of REverberation MOdeling for Speech recognition (REMOS) [1] is proposed. In order to achieve robust distant-talking Automatic Speech Recognition (ASR), the REMOS concept uses a combination of clean-speech HMMs and a reverberation model to perform feature-domain dereverberation during decoding. The simplified decoding/dereverberation method proposed ...
متن کاملSpeech Recognition by Dereverberation Method Based on Multi-channel LMS Algorithm in Noisy Reverberant Environment
1 Introduction In a distant-talking environment, channel distortion drastically degrades speech recognition performance because of mismatches between the training and test environments. The current approaches focusing on robustness issues for automatic speech recognition (ASR) in noisy reverberant environments can be classified as speech enhancement, robust feature extraction, or model adaptati...
متن کامل